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Abstract 

Herein, we applied statistical physics to study incomes of three (low-, medium- 
and high-income) society classes instead of the two (low- and medium-income) 
classes studied so far. In the frame of the threshold nonlinear Langevin dy- 
namics and its threshold Fokker-Planck counterpart, we derived a unified 
formula for description of income of all society classes, by way of example, of 
those of the European Union in year 2006 and 2008. Hence, the formula is 
more general than the well known that of Yakovenko et al. That is, our for- 
mula well describes not only two regions but simultaneously the third region 
in the plot of the complementary cumulative distribution function vs. an 
annual household income. Furthermore, the known stylised facts concerning 
this income are well described by our formula. Namely, the formula provides 
the Boltzmann-Gibbs income distribution function for the low-income society 
class and the weak Pareto law for the medium-income society class, as ex- 
pected. Importantly, it predicts (to satisfactory approximation) the Zipf law 
for the high-income society class. Moreover, the region of medium-income 
society class is now distinctly reduced because the bottom of high-income 
society class is distinctly lowered. This reduction made, in fact, the medium- 
income society class an intermediate-income society class. 
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1. Introduction 



For over two decades, physics oriented approaches have widely been de- 
veloped to explain different economic processes (and refs. therein). 
Those approaches aim at formulating well fitted unbiased indicators of social 
and economic phenomena. One of their key issues is the income of society 
analysis using methods of statistical physics. The main goal of this eco- 
nomic issue is to unravel and describe mechanisms of societies' enrichment 
or impoverishment. 

The first successful attempt in this socio-economic field was made by the 
legendary economist and sociologist Vilfredo Pareto He demonstrated 

that the distribution functions of individual incomes in different countries 
within stable economy are universal being manifested by a power law. This 
law is called the weak Pareto law. He emphasised that this law could not 
resemble the distribution functions obtained if the gain and accumulation of 
income were random. As a possible origin of this law, Pareto indicated a 
self-similarity structure of societies. 

Pareto's economic discoveries initiated attempts of analytical descriptions 
of incomes of the societies and inspired an avalanche of related research 
works [21, Is], |5|-0, 10-24|. Amon g th em, particularly significant are those of 
the economist Robert Gibrat (tI. [loi 11]. He found that the complementary 



cumulative distribution function of the Pareto distribution is insufficient to 
describe empirical data within the whole range of the income. Trying to find 
a functional form that could account for these data, he proposed a rule called 
the Rule of Proportionate Growth (see below for details). 

Furthermore, the income of societies was analysed by David Champer- 
owne, who constructed a stochastic model simulating the Pareto power law 



12| and also by Benoit Mandelbrot who described several useful properties 



of random variables subjected the Pareto distribution js], lof. 

In the recent decade, a large number of studies were performed aiming 
at constructing of models, which (to some extend) would well replicate the 
observed complementary cumulative distribution functions of individual in- 
comes. Among them, the most significant seems to be the Clementi-Matteo- 



Gallegati-Kaniadakis approach |17|. the Generalized Lotka-Volterra Model 



the Boltzmann-Gibbs law 13-16 , and the Yakovenko et al. model 



f 

et al. has been developed [2l|. It involves complex economic justification 



Very recently, a mathematical model similar to that of Yakovenko 



for microscopic stochastic dynamics of wealth. However, none of the above 
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attempts to find an analytical description of the income structure solves the 
principal challenges, which concern: 

(i) the description of the annual household incomes of all society classes 
(i.e. the low-, medium-, and high-income society classes) by a single 
unified formula and 

(ii) the problem regarding corresponding complete microscopic (microeco- 
nomic) mechanism responsible for the income structure and dynamics. 

In our considerations presented herein, we used Boltzmann-Gibbs law 
and Yakovenko et al. model to derive a uniform analytical formula 
describing income of all three society classes. 



2. Extended Yakovenko et al. model 

In accord with an effort outlined above, we compared the empirical data 
of the annual household incomes in the European Union (EU), including 
Norway and Iceland, with predictions of our theoretical approach proposed 
herein. This approach is directly inspired by the Yakovenko et al. model. 
By using the same assumptions, however, we generalised this model to solve 
the principal challenges (i) and (ii) indicated above. 

We used data records from the Eurostat Survey on Income and Living 
Conditions (EU-SILC) jisj, by way of example for years 2006 and 2008 
26 , 2?! (containing around 150 and 200 thousand empirical data points. 



respectively). However, these records contain (as all other records) only few 
data points concerning the high-income society class, i.e. the third region 
in the plot of the complementary cumulative probability distribution func- 
tion vs. annual household income. To consider the high-income society class 
systematically, we additionally analysed the effective income of billionaires 
in the EU by using the Forbes 'The World's Billionaires' rank jisf. The 
term 'billionaire' used herein is equivalent (as in the US terminology) to the 
term 'multimillionaire' used in the European terminology. Since we consider 
wealth and income of billionaires in euros, we recalculated US dollars to eu- 
ros by using the mean exchange rate at the day of construction of the Forbes 
'The World's Billionaires' rank. 

We were able to consider incomes of three society classes thanks to the 
following procedure. 



3 



Firstly, we selected EU billionaires' wealth from the Forbes 'The World's 
Billionaires' rank, for instance, for four successive years 2005 to 2008. 

Secondly, we calculated the corresponding differences between billion- 
aires' wealth for the successive years. We assumed that their incomes 
are, in fact, proportional to these differences. For instance, we cal- 
culated the billionaire incomes for year 2006 by taking the difference 
between their wealth in years 2006 and 2005. We made it analogously 
for year 2008. However, we took into account only biUionaires who 
gained effective incomes (neglecting those, who suffered from income 
losses) . 

Subsequently, having so calculated incomes for the high-income soci- 
ety class, we joined them (separately for years 2006 and 2008) with 
the corresponding EU-SILC datasets. By using so completed datascts, 
we then constructed the initial empirical complementary cumulative 
distribution function for years 2006 and 2008. For that, we used the 
well known WeibuU recipe (see below for details). However, this direct 
approach shows a wide gap of incomes inside the high-income society 
class resulting in a horizontal line of the complementary cumulative dis- 
tribution function. This gap separates the first segment belonging to 
the high-income society class, consisting of all data points taken from 
the EU-SILC dataset (only 8 for 2006 and 6 for 2008), from the second 
segment, consisting of remaining data points (76 for 2006 and 96 for 
2008), which also belong to the high-income society class but are taken 
from the Forbes dataset. 

In the final step, we eliminated this gap by adopting the assumption 
that the empirical complementary cumulative distribution functions 
(concerning the whole society) have no horizontal segments. That is, 
we assumed that statistics of incomes is a continuous function of in- 
come. Hence, we were forced to multiply the billionaire incomes from 
Forbes dataset by the properly chosen common proportionality factor. 
This factor was equal to 1.0 x 10^^ for both years, as we assumed the 
requirement of full overlap of the first (above mentioned) segment by 
the second segment. This assumption leads to a unique solution (up 
to some neghgible statistical error) for this proportionahty factor. We 
found that this factor was only a slowly-varying function of time (or 
years) . 
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Hence, we received the data record containing aheady a sufficient num- 
ber of data points for all society classes, including the high-income society 
class. Although the Forbes empirical data only roughly estimate the wealth 
of billionaires, they quite well establish the billionaires' rank, thus sufficiently 
justifying our approach. This is because our purpose is to classify billion- 
aires to concrete universality class rather than finding their total incomes. 
Our procedure of linking data from two different bases does not violate this 
universality class. 

The basic tool of our analysis is an empirical complementary cumulative 
distribution function being typical in this context. We calculated it accord- 
ing to the standard two-step procedure. For that, ffist, the income empirical 
data were ordered according to their rank, i.e. from incomes of the richest 
households to those of the poorest. Next, in accordance with the well known 



WeibuU formula 29|, |30| , we calculated the ratio where I is the position 
of the household in the rank and n is the size of the empirical data record. 
This ratio directly determines the required fraction of households of the in- 
come higher than that related to a given household position / in the rank. 
The complementary cumulative distribution function obtained that way is 
sufficiently stable. Furthermore, it does not reduce the size of the output 
compared to that of the original empirical data record. 

2.1. Hint to the Yakovenko et al. model 

Let m be an influx of income per unit time to a given household. We 
treat m as a variable obeying stochastic dynamics. Then, we can describe 
its time evolution by using the nonlinear Langevin equation j^, [s], [33[: 

^ = -A{m) + C{m)r]{t). (1) 

Here, A{m) is a drift term and 7]{t) is a white noise, where the coefficient 
C{m) is its m-dependent amplitude. As we prove below, already white noise 
is herein sufficient to produce two different power-laws. Obviously, general- 
isation with respect to power-law noise is also possible and very promising 



31|, |32]. Noteworthly, jump effects are important in the financial and social 
phenomena because they naturally produce power-law tails. However, cor- 
respondence between Ito and Stratonovich representations is then no longer 
trivial. 
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Notably, the above nonlinear stochastic dynamics equation is equivalent 
to the following Fokker-Planck (continuity) equation for the probability dis- 



tribution function 33 



= -^^JirnA (2) 

where the flux density of probability (in the Ito representation j33[) is 

d 

J(m, t) = -A{m)P{m, t) - [B{m)P{m, t)] . 

(3) 

Here, B{m) = C^(m)/2 while P{m,t) is the temporal income distribution 
function. In general, functions A{m) and B{m) can be additionally deter- 
mined by the first and second moment of the income change per unit time, 
respectively, only if these moments exist. 

The equilibrium solution of Eq. Peq, defined by vanishing of J{m,t) 



33 , takes the form: 



p / ^ const / r Ajm') ^ A 

-fcal"^) = -7^, — rexp — / -— - — -am (4) 

where minit is the lowest household income and const is a normalisation 
factor. Fortunately, both Ito and Stratonovitch representations jssj give 
almost the same equilibrium distribution function. These representations 
differ only by some preexponential factor. 

Following the Yakovenko et al. model [2I, lij , we can assume that changes 
of income of the low-income society class are independent of the previous 
income gained. This assumption is justified because the income of households 
belonging to this class mainly takes the form of wages and salaries. The 
stochastic process associated with the mechanism of this kind is called the 
additive stochastic process. In this case, coefficients A{m) and B{m) take, 
obviously, the form of positive constants 

A{m) = Ao, B{m) = Bq. (5) 

This choice of coefficients leads to the Boltzmann-Gibbs law with the expo- 
nential complementary cumulative distribution function 

n(m) = j Peq{m') dm' = exp (- ^ ^'^ j ■ (6) 
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In Equation distribution function is characterised by a single parameter, 
i.e. an income temperature T = Bq/Aq, which can be interpreted in this case 
as an average income per household. 

For the medium- and high-income society classes, we can assume (again 
following Yakovenko et al. p, Isj) that changes of income are proportional to 
the income gained so far. This assumption is also justified because profits go 
to the medium- and high-income society classes mainly through investments 
and capital gains. This type of stochastic process is called the multiplicative 
stochastic process. Hence, coefficients A{m) and B{m) obey the proportion- 



ality principle of Gibrat [10|, lUl 



A{m) = am, B{m) = hra^ <^ C{m) = \/2bm, (7) 

where a and b are positive parameters. By using the equilibrium distribu- 
tion function (jl]), we arrive in this case to the weak Pareto law with the 
complementary cumulative distribution function [2I, Isi 0|: 

n(m) = / Pcqim') dm' = i . (8) 

J in V^sp/ 

Here, msp is a scaling factor (depending on a, b, and const) while a = 1 + a/b 
is the Pareto exponent. The ratio of the a to 6 parameters can directly be 
determined from the empirical data expressed in the log-log plot (by using 
their slopes). 

As Yakovenko et al. have already found jil, Isl, the coexistence of additive 
and multiplicative stochastic processes is allowed. By assuming that these 
processes are uncorrelated, we get 

A{m) = Aq + am, 

B{m) = B^ + bm^ = b{ml + m^), (9) 

where m^ = Bo/b. This consideration leads (together with Eq. (|1])) to a 
significant Yakovenko et al. model with the probability distribution function 
given by 

g— (mo /T) arctan(m/r?io) 
Peaim) = const ; (10) 

^ [1 + (m/mo)2]("+i)/2 ^ ^ 

where parameters a and T are defined above. For m <^ mo, Eq. f fTOj) becomes 
the Boltzmann-Gibbs law while for m ^ mo it becomes the weak Pareto law. 
Notably, the mo parameter is the crossover income between ranges of additive 
and multiplicative processes. 
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2.2. Our extension 

Based on the Yakovenko et al. Eq. ( ITOj) . the complementary cumula- 
tive distribution function can be used to describe income of only low- and 
medium-income society classes. However, it does not capture that of the 
most intriguing high-income society class. Therefore, the goal of our present 
work is to derive from Eq. (jl]) such a distribution function, which would cover 
all three ranges of the empirical data records, i.e. low-, medium-, and high- 
income classes of the society (including also two short intermediate regions 
between them). 

The high-income society class is mainly that composed of the company 
owners. Hence, besides the weak Pareto law, we expect that their household 



incomes obey (to a good approximation) the Zipf law |34j-l36| (the Zipf law 
is the Pareto law with the exponent a = 1). In order to derive the Zipf law 
from Eq. (jl]), we have to provide, therefore, functions A{m) and B{m) in 
the threshold form: 

^j-^) = / = ^o + o"^, ifm<mi 

1 A-(m) = A'r. + a' m, if m > mi 



^(m) = Bq + him? = b (ttIq + m^) 



B 

- ) if m < mi , . 

i^Kjn)- \ ^>^^^ = B'^\ b'm^ = b'{m'^ + m^) ^^^> 

if m > mi 

where mg = Bo/b and tuq = B'^/b' . The threshold parameter mi can be 
interpreted as a crossover income between the medium- and high-income 
society classes. Remarkably, both income crossovers mo and mi(> mo) are 
exogenous parameters. They should be determined from the dependence of 
the empirical complementary cumulative distribution function on variable m 
because both crossovers are sufficiently distinct (see below for details). 

Apparently, we assumed above that the formalism of the income change 
is the same for the whole society. This formalism is expressed by the thresh- 
old nonlinear Langevin equation where particular dynamics distinguishes the 
range of the high-income society class from those of the others. 

For protection of the equilibrium distribution function against disconti- 
nuity at the threshold mi (which means adoption of the continuity principle 
of the equilibrium distribution function of household incomes being a kind 
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of Ockham's razor principle), the following requirement should be satisfied: 

P<(m = mi) = P|(m = mi) (13) 



where 



and 



^ const ( A<{m') , , , 



B^{m) \ J^.^^.^ B{m 



const [ pi A<{m') 



dm! 



nit 



-L iMn- '''' 

By substituting Eqs. f|T4|) and f lTSj) into Eq. flT3|) . we directly obtain 
B^{m = mi) = B-{m = mi) 

To assure that the interpretation of the parameter mg is consistent with the 
income crossover mo, we further put 

m[,=mo^^ = f. (17) 

Moreover, in accordance with Eq. (fT7|) . we make even more rigorous assump- 
tions 

B'q = Bo and b' = b. (18) 



Subsequently, by substituting Eqs. ( ITTi) and ( |T2l) into Eqs. ( |T4l) and ( |T5l) . 
we finally get 

{„/ exp(-(mo/r) arctan(m/mo)) if rn ^ rr, 

^ |l+(m/mo)2](-+i)/2 ' 1I"^<'^1 , - 

„ cxp(-(m.o/Ti) arctan(m/mo)) \f ^ m ^ ' 

^ [l+(m/mo)2]("i + i)/2 ' II ^ "^1 
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where ai = l + a'/b and Ti = Bq/Aq while c' and c" are mutually related con- 
stants. These constants are proportional to the normalisation factor const. 
Besides, constant c' depends on minit, ttiq, T, and a while c" additionally 
depends on Ti, mi, and ai. Apparently, the number of free (effective) pa- 
rameters driving the two-branch distribution function, Eq. f|T9|) . is reduced 
because this function depends only on the ratio of the initial parameters 
defining the Langevin dynamics ([T]). 

For mi 3> mo, the interpretation of the distribution function, Eq. (fT9|) . 
is self-consistent, as required, because the two power-law regimes are well 
defined. Then, for instance for m ^ mo, the second branch in Eq. (fT9|) be- 
comes the power-law dependence driven by the Pareto exponent ai different 
(in general) from a. 

Importantly, our analysis indicates that the existence of the third income 
region is already allowed by theory. We are following this indication below. 

3. Results and discussion 

In principle, we are ready to compare the theoretical complementary cu- 
mulative distribution function based on our probability distribution function 
-Pcq(^), given by Eq. ( !T9|) . with the empirical data for the whole income 
range. However, the analytical form of this theoretical complementary cu- 
mulative distribution function is unknown in the closed explicit form. There- 
fore, we calculate it numerically. The key technical question arises on how 
to fit this complicated theoretical function to the empirical data. The fitting 
procedure consists of three steps as, fortunately, all parameters are to be 
found (in principle) by using independent fitting routines, as follows. 

In the initial step, we found rough (more or less) approximated values of 
crossovers mo and mi directly from the plot of the empirical complementary 
cumulative distribution function (or empirical data). Thus, uncertainty of the 
mo and mi parameters did not exceed 10%, which was sufficiently accurate. 
Moreover, we took the exact value of the parameter minit as the first point 
in the record of the empirical data. 

Secondly, we determined the temperature T value by fitting the Boltzmann- 
Gibbs formula, Eq. (E]), to the corresponding empirical data in the range 
extending from minit to mo (both found in the initial step). Notably, we as- 
sumed that this formula could be characterised by a single temperature value 
since the society as a whole was considered to be in (partial) equilibrium dur- 
ing the whole fiscal year. That is, we further put Ti = T Aq = Aq. 
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At the third step, we determined exponents a and a\ by separately fitting 
the weak Pareto law to the empirical data for the medium- and high-income 
society classes, respectively. 

The results obtained in these three steps are correspondingly presented 
in Figs. [1] and [2], mainly in the log- log scale (only in Fig. [1] the inserted plot 
is presented in the log- linear scale). 




m 



Figure 1: Fit of the (exponential) Boltzmann-Gibbs law, Eq. ®, (solid line) to empirical 
data (dots) of the EU low- income society class for year 2008 [27|, [28 1 in the log-log scale, 
for parameters minit = 0.01 EUR, toq = 1-40 x 10^ ± 0.14 x 10^ EUR and T = 34902 ± 3. 
The inset shows the fit in the log-linear scale. 



The plot in Fig. [T] shows the complementary cumulative exponential 
distribution function, i.e. the Boltzmann-Gibbs law, Eq. (jH]), which quite 
well describes the EU low-income society class. This finding significantly 
supports universal applicability of the Boltzmann-Gibbs law in economy. 

Subsequently, the plot in Fig. [2] was constructed. It quite well describes 
the EU medium-income society class by the weak Pareto law. Apparently, 
by joining the Forbes empirical database concerning an effective income of 
billionaires with the EU-SILC database, we found that the Pareto (effective, 
nonuniversal) exponent increased from a = 2.28 ± 0.01 to a = 2.902 ± 0.002. 
This result defines the range of Pareto exponents. This range covers, e.g. 
the exponent a = 2.67 obtained very recently for the medium-income soci- 
ety class in Romania for 2008 by considering a voluminous social security 
database 37|. However, this database contains only empirical data for low- 
and medium- income society class (in our terminology). In principle, it would 
be also possible to join this voluminous database with the Forbes correspond- 
ing dataset if Romania billionaires are present as members of the Forbes 
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rank. As a result of our joining, the range of the medium- income society 
class became much narrower and shifted to incomes reduced by one order of 
magnitude. The medium-income society class is so sensitive to the size of 
the high-income society class as the former contains only no more than 3% 
of all households. 

These results are significant as they demonstrate a crucial role of two 
income society classes in the society structure, that is the low- and the high- 
income society classes. 
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Figure 2: Curves 1 and 2: the fit of the weak Pareto law, Eq. (|5]), (solid line) to empirical 
data (dots) of the EU medium-income society class for year 2008, for parameters uiq — 
1.40 X 10^ ± 0.14 X lO'^ EUR, mi = 4.0 x 10^ ± 0.4 x 10^ EUR and a = 2.902 ± 0.002 if 
the Forbes database is included (curve 2) or a = 2.28 ± 0.01 if the Forbes database is not 
included (curve 1). Curve 3: the fit of the weak Pareto law, Eq. ([8|), to empirical data 
(dots) of the EU high-income society class for year 2008, for parameter ai = 0.79 ± 0.01 



Remarkably, we fitted again the weak Pareto law ([8]) to the so completed 
empirical data by taking into account only the high-income society class (see 
plot in Fig. [2]). Again, this class is well described by the weak Pareto law, 
however, driven by the exponent ai slightly lower than 1.0 (cf. caption to 
Fig. [2]). This result was expected, as the high- income households belonging 
to the high-income society class are usually the owners of companies, whose 
profits are described, indeed, by the Zipf law. 

Importantly, a power-law distribution reveals special property for expo- 
nent ai < 1. That is, the first and higher moments (here moments of income) 
diverge. This means that the proper approach should apply quantiles (which 
are always finite 38|) instead of expectation values. For instance, the median 
should be used instead of the mean value. Anyway, the moment estimates 
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are always finite and can be calculated directly from the empirical data. 
Moreover, there is no characteristic (physical) scale in the high-income soci- 
ety class as the mean value diverges. That is, the (hierarchical, self-similar 
in a probabilistic sense) income structure of the high-income society class 
is scale free making all levels of the structure equivalent. In other words, 
the same dynamical rules apply across the entire high-income society class 
independently of the particular income of different households (belonging to 
the high-income society class) (isf . 

To complete our analysis, we calculated the Pareto exponent for the high- 
income society class by using an alternative approach. We consider the rank 
of the difference between the wealth of billionaires in successive years (here 
year 2008 and 2007 as well as 2006 and 2005). These ranks, i.e. straight lines 
in the log-log scale, are plotted in Fig. [31 The slopes of these lines (equal 
to — ttrank) wcrc Calculated using a fitting routine. The inverse of Orank gives 
the Pareto exponent apareto- 
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Figure 3: Ranks of incomes of billionaires in the EU. Solid lines were obtained by fit- 
ting straight lines (in the log-log scale) to empirical data (dots) for year 2008 (aparcto = 
l/ttrank = 1/1-22 — 0.82 ± 0.02) and to empirical data (squares) for year 2006 (aparcto = 
l/ara„k = 1/1.07 = 0.93 ±0.04) 



Apparently, our calculation of Pareto exponent by using two independent 
methods gives almost the same results (oj = 0.79 ± 0.01 and apareto = 
0.82±0.02) for year 2008, as expected [23|,l35|], suggesting that these methods 
are mutually consistent. We show below that the analogous conclusion is 
fulfilled also for year 2006 (cf. Figs. [3] and H]) as exponents ai and apareto 
are almost equal (ai = 0.90 ± 0.04 and apareto = 0.93 ± 0.04). 

Hence, we have already obtained all values required by the extended 
Yakovenko et al. formula, Eq. flTIJl) . The corresponding plots of the empirical 
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and theoretical complementary cumulative distribution functions in the log- 
log scale are compared in Figs. S] and for years 2006 and 2008, respectively. 
Apparently, the predictions of the extended Yakovenko et al. formula, Eq. 
f|T9|) . (solid curves in Figs. [Hand [5]) well agree with the empirical data (dots 
in Figs, m and [5])011. That is, the extended Yakovenko et al. model well 
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Figure 4: Fit of the complementary cumulative distribution function, based on the ex- 
tended Yakovenko et al. formula, Eq. (|19l) . (solid line), to the EU household in- 
come empirical data (dots) for year 2006 (Ti = T2 = T = 43 x 10^ ± 1 x 10^ EUR, 
Too = 1.20 X 10^ ± 0.12 X 10^^ EUR, toi = 3.70 x 10^ ± 0.37 x 10^^ EUR, a = 3.171 ± 0.002 
, ai = 0.90 ± 0.04 ). The first and the second vertical line represents toq and toi 
respectively. 



describes the empirical complementary cumulative distribution functions of 
the household incomes in the EU for all society classes, i.e. for the low-, 
medium-, and high-income society class. These successful descriptions re- 
sult from the sufficiently realistic assumptions of the model adopted herein. 
These assumptions allow for coexistence of additive and multiplicative pro- 
cesses as well as for the continuity principle of the equilibrium distribution 



^The value of the income temperature T, obtained from the fit of the Boltzmann-Gibbs 
law to the empirical data points for the low-income society class, is 35321 (±4) EUR for 
year 2006. However, the fit of the complementary cumulative distribution function, based 
on the Yakovenko et al. formula, Eq. (fT9| , to all empirical data points is slightly improved, 
as we used the income temperature T higher by ca. 20%. 

^For year 2008, the value of the income temperature T, obtained from the fit of the 
Boltzmann-Gibbs law to the empirical data points for the low-income society class, is 
34902(±3) EUR. However, the fit of the complementary cumulative distribution function, 
based on the Yakovenko et al. formula, Eq. to all empirical data points is slightly 

improved, as we used the income temperature T higher by ca. 10%. 
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Figure 5: Fit of the complementary cumulative distribution function, based on the ex- 
tended Yakovenko et al. formula (|19|) (solid line), to the EU household income em- 
pirical data set (dots) for year 2008 (Ti = Ts = T = 39.5 x 10^ ± 1 x 10^ EUR, 
TOO = 1.40 X lO'^ ± 0.14 X 10^ EUR mi = 4 x 10^ ± 0.4 x 10^ EUR, a = 2.902 ± 0.002, 
ai = 0.79 ± 0.01). The first and the second vertical line represents toq and toi 271 128|, 
respectively. 



function of the household incomes to be obeyed. 
4. Conclusions 

Herein, we proved that the household incomes of all society classes in the 
EU can be modelled by the nonlinear threshold Langevin dynamics ([T]) with 
m-dependent drift term, A{m), and m-dependent dispersion, B{m), given 
by Eq. ffTTl) and f|T2|) . respectively. At the threshold mi, there is a jump of 
the proportionality coefficient of the drift term. That is, this term abruptly 
changes from a to a', where a' < a (as ai < a). It means that the stochastic 
term in Eq. ([T]) is relatively more significant in this case (i.e. the above 
threshold mi) than the drift term. That is, economic activity of the high- 
income society class is much more risky than activities of all other society 
classes, as expected. 

By comparing results obtained for years 2006 and 2008 (cf. Figs. H] and 
[5]), we found that the threshold mi was only slightly higher for the latter 
year than for the former. It means that in year 2006, to the high- income 
society class belonged the society members almost as rich as those belonging 
to this class in year 2008. Moreover, the result of only slightly more extensive 
society stratification in year 2008 was confirmed by exponents' inequality, as 
the exponent ai for year 2006 is only slightly higher than that for year 2008. 
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Furthermore, the slowly-varying tendency (similar to that considered above) 
is also observed for the medium-income society class. In fact, it is surprising 
how stable with respect to the recent financial crisis the shape of the curve 
n(m) vs. m is. That is, only the number of households belonging to a given 
income society class most likely changed but the income structure of the 
society as a whole was not altered. 

The completed database, which we used (by properly joining the Forbes 
empirical database with that of EU-SILC), emphasises a significant role of the 
high-income society class. Namely, the presence of the third region increases 
this Pareto exponent which characterises the medium-income society class 
making the range of this class narrower and shifting it to incomes lower by 
one order of magnitude (cf. Fig. H]). This latter society class is now so much 
reduced that it occupies almost an intermediate region between low- and 
high-income society classes. Apparently, the role of the low- and medium- 
income society classes was, in the present case, significantly reduced. 

The use of two different datasets (EU-SILC and Forbes), which are not 
necessarily compatible, carries a methodological danger of getting discontin- 
uous fits. Nevertheless, bringing these two datasets together and analysing 
them jointly is better for making progress in this field then avoiding their 
comparison. 

Herein, we succeeded in comparing ratios (i.e. relative population of 
successive income society classes) of ri = and r2 = 

for both year 2006 and 2008 by using our formula, Eq. (JT9l) . Hence, we 
determined ri = 32.66 and r2 = 16.48 for year 2006 as well as ri = 48.98 
and r2 = 13.97 for year 2008. We obtained information on, relatively, how 
many society members belong to a given income society class. Apparently, 
population of the medium-income society class is strongly decreased in year 
2008 in comparison to that in year 2006. Several members of this class 
were shifted both to the low- and to high-income society classes. Our low- 
parameter approach seems to be much more sensitive than that using the 
Gini coefficient (G) [i^ because we obtained G = 54.34 and G = 54.89 for 
year 2006 and 2008, respectively. 

Furthermore, we estimated the percentage breakdown of population of 
the society classes: for year 2006 - low-income: 96.85%; medium-income: 
2.97%; high-income: 0.18% and for year 2008 - low-income: 97.86%; medium- 
income: 2.00%; high-income: 0.14%. These results can be considered as 
complementary to that (obtained above) corresponding to the relative pop- 
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ulation of successive income society classes. Interestingly, the total fraction 
of the medium- and high-income classes in the EU was around 3% in year 
2006, which is about the same as that found by Yakovenko et al. jl] for the 
US, and this fraction has decreased to around 2% in year 2008, most likely 
due to the financial crisis. 

Economists often argue that economic and political conditions are quite 
different in the US and in the EU, and expect a lower income inequality in 
the EU. However, we demonstrate quantitatively herein that the exponential 
law does apply to the EU as well. This finding gives much stronger support 
for universal applicability of the Boltzmann-Gibbs law in economics. 

Our work shows that the income distribution in the low-income class, 
covering around 97% of population, follows the exponential law, whereas in 
the two upper-income classes the distribution follows two power laws with 
different exponents. Remarkably, the role of the medium-income society class 
is strongly reduced making it an intermediate one within our approach to the 
complementary cumulative distribution function. 
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